Comprehensive overview of various haplotype block inference methods
نویسنده
چکیده
During last years there has been a breakthrough in genetics and biotechnology. Present technology allows large scale association studies between genotypes and diseases. The most powerful methodology of association studies is the analyse of single nucleotide polymorphisms (SNPs). The estimated number of SNPs in the human genome is 10 million. Therefore, SNP markers provide practical fine-grain map of human genome. Theoretical models and practical experiments suggest that a single strand of DNA is composed from haplotype blocks—sequences of fixed DNA strands. In other words, few suitably chosen SNPs can reveal most of DNA strand and reduce the number of required SNPs for association studies. The following review gives a introduction to basic concepts and models. We analyse pros and cons of various definitions of haplotype blocks, point out advantages and limitations of inference methods. We cover all three main approaches: methods based on marker pairs, combinatorial methods and models with minimal-description length. We also briefly discuss the optimal marker set (OMS) problem that is a common for all methodologies. Good solution can provide high quality of the end results and lower the measurement costs. However, the complete treatment of OMS is out of our scope.
منابع مشابه
یک مدل ریاضی جدید برای مساله استنباط هاپلوتایپها از ژنوتایپها با معیار پارسیمونی
The haplotype inference is one of the most important issues in the field of bioinformatics. It is because of its various applications in the diagnosis and treatment of inherited diseases such as diabetes, Alzheimer's and heart disease, which has provided a competition for researchers in presentation of mathematical models and design of algorithms to solve this problem. Despite the existence of ...
متن کاملHaplotype Block Partitioning and tagSNP Selection under the Perfect Phylogeny Model
Single Nucleotide Polymorphisms (SNPs) are the most usual form of polymorphism in human genome.Analyses of genetic variations have revealed that individual genomes share common SNP-haplotypes. Theparticular pattern of these common variations forms a block-like structure on human genome. In this work,we develop a new method based on the Perfect Phylogeny Model to identify haplo...
متن کاملRobustness of Inference of Haplotype Block Structure
In this report, we examine the validity of the haplotype block concept by comparing block decompositions derived from public data sets by variants of several leading methods of block detection. We first develop a statistical method for assessing the concordance of two block decompositions. We then assess the robustness of inferred haplotype blocks to the specific detection method chosen, to arb...
متن کاملInference of missing SNPs and information quantity measurements for haplotype blocks
MOTIVATION Missing data in genotyping single nucleotide polymorphism (SNP) spots are common. High-throughput genotyping methods usually have a high rate of missing data. For example, the published human chromosome 21 data by Patil et al. contains about 20% missing SNPs. Inferring missing SNPs using the haplotype block structure is promising but difficult because the haplotype block boundaries a...
متن کاملThe effect of single-nucleotide polymorphism marker selection on patterns of haplotype blocks and haplotype frequency estimates.
The definition of haplotype blocks of single-nucleotide polymorphisms (SNPs) has been proposed so that the haplotypes can be used as markers in association studies and to efficiently describe human genetic variation. The International Haplotype Map (HapMap) project to construct a comprehensive catalog of haplotypic variation in humans is underway. However, a number of factors have already been ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009